Solvable senescence model with positive mutations 
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We build upon our previous analytical results for the Penna model of senescence to include positive 
mutations. We investigate whether a small but non-zero positive mutation rate gives qualitatively 
different results to the traditional Penna model in which no positive mutations are considered. We 
find that the high-lifespan tail of the distribution is radically changed in structure, but that there 
is not much effect on the bulk of the population. The mortality plateau that we found previously 
for a stochastic generalization of the Penna model is stable to a small positive mutation rate. 

PACS numbers: 87.23.-n, 87.10. +c 



I. INTRODUCTION 

The Penna model, introduced by Penna in 1995 
has been extensively studied through simulation 0, 
It models senescence in an asexually reproducing pop- 
ulation, with the lifetime of an individual encoded in a 
simple bit-string model of its genome. During reproduc- 
tion the bit-string is allowed to copy and mutate, so that 
the population evolves a distribution of genetic lifespans. 
The model is powerful enough to reproduce some aspects 
of observed real-life behaviour, and adaptable enough to 
allow modifications aimed at greater realism 0, |j| . 

One shortcoming of the standard Penna model is that 
a mutated genome is always worse than its parent 0. 
While this is a reasonable assumption, given that the rate 
of beneficial mutations is small compared to that of harm- 
ful ones, very little work has been carried out on Penna 
models with positive mutations |(|. Here we develop an 
analytical solution to a Penna model with a small rate 
of positive mutation. Our analysis builds upon previous 
work where a solution to a generalized class of stochastic 
Penna models was presented Q • All populations are con- 
sidered in the thermodynamic limit of a large population 
where statistical fluctuations and effects of discretization 
can be ignored. The exact limiting behavior found below 
gives insight into any population large enough for these 
two effects to be unimportant. 

In the standard (deterministic) Penna model, the bit- 
string determines the lifespan of an organism directly. 
One way (but not the only way [8|) to interpret this is 
to say that there is a set of hereditable diseases, which 
strike the organism at a set of fixed ages during its life- 
time. Once it has developed a given number of these, the 
organism dies. Although the age of death is thus pro- 
grammed from birth, an individual is in effect reading 
the bit string sequentially through its life; it drops dead 
after encountering the specified number of defective bits. 
(We call these Is; the rest are 0s.) So long as it lives, 
the organism produces offspring at a steady rate; these 
inherit the parental genome, with a small rate m of harm- 
ful mutation (0 — > 1) and, in this paper, an even smaller 



rate a for positive mutation, 1 — ► 0. 

The simplest of all Penna models is that in which an 
organism dies after developing a single disease. For clar- 
ity of argument we confine our discussion to this version 
of the Penna model in Sections |IIIIIII However, the anal- 
ysis of positive mutations that we present here can be 
adapted to any of the more sophisticated Penna model 
variants we have solved In Section HVI we address a 
stochastic variant showing a mortality plateau ■ 



II. A SIMPLE PENNA MODEL 

We consider first the single-disease simple Penna 
model, without positive mutation. The sites along the 
bit-string are numbered x = 0,1,2,....; an organism in 
whose bit-string the first 1 occurs on site I will live for 
exactly I timesteps and thus has 'genetic lifespan' I. Note 
that an organism with I = (i.e., a 1 at site x = in 
the string) will die instantly and never contributes to the 
population. Note also that x can be thought of as the 
age of an organism; so long as x < I, it remains alive. 

The number of organisms with age x and genetic lifes- 
pan I at timestep j is denoted by rij{x,l). Newborn or- 
ganisms with genetic lifespan I can be produced cither 
as copies of organisms with the same Z, or as mutated 
copies of organisms of lifespan /' > I. In both cases, sites 
0..J — 1 must go unmutated. These dynamics give the 
following discrete evolution equation 

oo 

n J+ i(0,0 = be-Pt^T nj{x, I) (1) 

x=0 

oo oo 

rij(x, I') 



V>1 x=0 

where — 1 — m, with m the (small) mutation rate and 
b the birth rate. The sum over ages of n{x, I) is defined 
to be n(l) and can be evaluated at steady state to give 
n(l) — ln(0,l). At steady state, the size of any part of 
the population is time-independent. Manipulation of 
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then gives a recursion relation for the relative sizes of 
sub-populations at steady state: 



n(l+ 1) _ 1 + 1 



°f l - bl 



I e W +1 ) - b(Z + l)e"^ ' 



(2) 



For the population to remain finite there must be a 
maximum sustainable gentic lifespan l max ; for / > i max , 
n{l) = (see Section Till Bl for further discussion). Any 
sub-population with a smaller genetic lifespan is partly 
reliant on a flux, by mutation, from longer lived sub- 
populations. These two conditions give restrictions on 
the choice of l maXl Ji and (3, as explained, and confirmed 
by simulation, in pj: 



max ^ 

b = 



1 



1 - e-P ' 
1 



(3) 
(4) 



III. SOLUTION WITH POSITIVE MUTATION 

We now introduce a small positive mutation rate a into 
the simple one-disease Penna model just described. Just 
as a harmful mutation converts a into a 1, a positive 
mutation rate a converts a 1 into a with probability 
a. The rate a is taken to be sufficiently small that there 
is no chance of multiple positive mutations occuring on 
the same organism. Further, we assume that a is small 
compared to the rate (3 of harmful mutations. This en- 
sures that after the first 1 in an organism's bit string, the 
remaining bits are, to high accuracy, all l's due to the 
accumulated effects of harmful mutations. In the absence 
of positive mutation this is clearly the the accu- 

mulation of harmful mutations is irreversible and there is 
no evolutionary pressure for an organism to have healthy 
sites on its bit-string beyond the site x — I. Positive 
mutations allow this accumulation to be reversed, but so 
long as a <C /3, there remains no evolutionary pressure 
on sites after the first 1; an organism's bit-string can thus 
be taken to consist of entirely l's after the first 1. This 
is a very strong condition, and in steady state it allows 
(as in Section ^) any bit-string to be characterized by a 
single number, the genetic lifespan /. 



A. Dynamical equations 

When considering mutations on an offspring, we im- 
pose that positive mutations take place first, then nega- 
tive mutations. This is to further enforce the weak nature 
of positive mutations, preventing them from overriding 
harmful mutations in the same time-step. 

Introducing the positive mutation rate a then gives a 



slightly modified equation for rij(0, 1) in place of Eq.Q: 

oo 

n j+1 {0,l) = (l-a^e-^^rijixj) (5) 

oo 

+ abe~ 0l ^nj(x,l - 1) 

x=0 

oo oo 

+ (l-a)mbe- 0l J2J2 n i( x > 1 '^ 

1>>1 x=0 
oo oo 

+ ambe- pl ^^ nj {x,l"). 

l">l x=0 

The first term on the right corresponds to organisms 
with genetic lifespan I reproducing with no harmful mu- 
tations at sites x < I and no positive mutation at x = I. 
(Note that there are I sites with x < I; the factor e _/3/ is 
the probability of no harmful mutation at any of these.) 
The second term corresponds to offspring from organisms 
with lifespan I— 1 with one positive mutation (at site I — 1) 
and no harmful mutations at sites x < I. The third term 
gives mutated offspring from longer lived organisms (of 
lifespan I' > I) without positive mutation, without harm- 
ful mutation for sites x < I, but with harmful mutation 
at site x = I. The final term gives mutated offspring 
from longer lived organisms of lifespan I" where positive 
mutation occurs at site x = I" but is negated by harmful 
mutation at Z, with no harmful mutation for sites x < I. 

Defining n(l) = ~^2 X n(x,l) as the total number of in- 
dividuals of lifespan I, in the steady state we find 



{l-^be-* 31 +ambe- til )n(l) (6) 
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+ abe-P l n(l - 1) + m&e^ ^ n{V) 



v>i 



Writing a similar expression for n(l + 1) allows construc- 
tion of a recursion relationship between n(l), n(l — l) and 
n(l + 1). (Note that n(0) is known to be zero.) This can 
be written, for given (3, b and a, as: 



n(l+l) 



l + l 



1 



(7) 



ju _ (1 



-P)bl]n{l)-abln{l-l) 



e /3(i+i) — (I — oi)b(l + l)e — ^ 
Note that if a is set to zero, Eq.@ is recovered. 



B. Subtleties at large I 

Deducing boundary conditions at large I, in the pres- 
ence of positive muations, is non-trivial. The possibility 
that an infinitely long lived organism can evolve from 
a population of shorter genetic lifespans means that no 
part of the population is uniquely self sustaining. This 
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is in contrast to the simple Penna model (a — 0) where 
the maximum sustainable genetic lifespan Z max remains 
finite in the thermodynamic limit of a large population; 
individuals of I — Z max reproduce unmutated offspring at 
a rate that precisely balances their own death rate. With 
positive mutation present this cannot be true. Organisms 
of putative maximal lifespan ^ max can produce positively 
mutated offspring with a longer lifespan, and in a large 
enough population there can be no Z max . 

Instead we insist that, for a steady state distribution 
to be physical, the population must be finite (with the 
thermodynamic limit taken only at the end). The effects 
of this requirement can be seen by taking the limit of an 
arbitrarily small birth rate b When taking this limit we 
do not apply the steady state conditions ll.'il II) which hold 
for the simple Penna model only. In the limit of a vanish- 
ing birth rate, both the simple Penna model and Penna 
model with positive mutations return a recursion rela- 
tion for the relative sizes of successive sub-populations 
as follows: 



7l(l + 1) _ I + 1 

n(l) ~ ~T~' 



-0 



(8) 



This limiting expression is independent of the birth rate 
so that, as b tends to zero the population can remain 
non-zero, with a distribution that is solely dependent on 
(3. This non-intuitive result can be explained by the be- 
haviour of organisms with infinite /. An infinitely long- 
lived organism can reproduce during its lifetime despite 
an arbitrarily small birth rate b. Moreover, in the limit 
of b — > 0, the mutated offspring of this 'super organism' 
make up the entire population. However, a population 
with even one super organism cannot be finite since, ac- 
cording to (JHJ, n(l — » oo)/n(l) = 0. It is clear, therefore, 
that steady state solutions of the recursion Q involving 
the presence of a super organism are not physical, and 
should be discarded. 

To summarize, in the simple Penna model of Section 
[DJthc requirement of a finite population directly imposes 
au Z max . When positive mutations are allowed there is 
no i maX ) but the population is still required to be finite. 
Thus the steady state must not contain super organisms 
if it is to represent a physical population, and the ther- 
modynamic limit must be taken so as to exclude them. 



C. Results 

An acceptable steady state can be found by imposing 
an artificial maximum genetic lifespan l c beyond which 
n(l) is taken to be zero. With this artificial cut-off, for 
specified a and 0, b can be found so as to satisfy the 
steady state conditions. As l c approaches infinity the 
difference between this approximate steady state and the 
real steady state will vanish for a sufficiently large popu- 
lation. The convergence of b with increasing l c is shown 
in figH We require that at l c the values of n(l c ) and 
n(l c — 1) predicted by the positive mutation recursion 
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FIG. 1: Birth rate b is plotted against the trucated genetic 
lifepan l c for a = O.OOf (x) and a = 0.005 (+). In both cases 
(3 = j^j. Genetic lifespan is measured in timesteps. 
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FIG. 2: Lifespan distributions for a Penna model with positive 
mutations in which a — 0.005 (x) and a = 0.001 (+) and 
without positive mutations (0). In each case (3 = gg. Genetic 
lifespan is measured in timesteps. 



relation satisfy to high order 

n(l c ) 



able 



i(l c - 1) el 31 - - ambl c - (1 - a)bl c 



(9) 



This ensures that contributions from longer lived organ- 
isms vanish, preventing the existence of a super organism 
as l c tends to infinity. This procedure resolves the para- 
doxes associated with the thermodynamic limit that were 
set out above, and allows us to find all the population 
properties from the recursion derived earlier. 

Analytical results, found by this method, for a Penna 
model with a small but non-zero a are shown in fig [21 
The population distributions with and without positive 
mutations are not dissimilar, but there are crucial dif- 
ferences. As discussed, with positive mutations allowed 
there is no maximum genetic lifespan. What is observed 
instead is a small but non-zero population beyond what 
would be Z max in the simple Penna model (a = 0). The 
remaining distribution closely resembles a = 0, with Z max 
taking its largest allowed value for the specified /3. 
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FIG. 3: Mortality rates for a Penna model with positive muta- 
tions and Fermi like survivability function in which w = 0.12, 
a — 0.005 (x) and a = 0.001 (+) and without positive muta- 
tions (©). In each case /? = -L. Age is measured in timesteps. 

Note that, depending on the size and history of the 
population, the maximum lifespan of a simple Penna pop- 
ulation may be less than the largest permissible value of 
l max 0, due to the effect of Muller's rachet ^ n other 
words, if a fluctuation within a finite population causes 
the self-sustaining sub-population of longest-lived indi- 
viduals to die off, they can never return and the max- 
imum lifespan is permanently reduced. With positive 
mutations this loss of fitness due to statistical fluctua- 
tions is reversible; positive mutations can act to restore 
the mean fitness of a population which has fallen below 
the maximum permitted by the simple Penna model. 

IV. EFFECT ON MORTALITY PLATEAU 

In we demonstrate that a Penna model with a mod- 
ified survival function can exhibit a mortality plateau at 
advanced ages 0,^3)- The strict deterministic nature 
of the original Penna model is relaxed; organisms with 
genetic lifespan I do not necessarily die at age I, though 
this is their expected lifespan. The 'step function' sur- 
vival up to age I of the simple Penna model is softened to 
a Fermi-like function whose width parameter w controls 



the genetic indeterminacy in the age of death. 

We observe that for a small positive mutation rate the 
mortality plateau is preserved (see figEJ). This plateau is 
the result of organisms living beyond I 'on borrowed time' 
Q. Even with a > 0, the majority of the organisms at 
advanced ages are rare survivors to ages far beyond I; and 
the mortality of these survivors is almost constant. The 
variation in mortality rates as a increases comes from 
the increased fraction of organisms with longer genetic 
lifespans (see figHJ). Since a positive mutation rate acts 
to increase the average lifespan of the population a cor- 
responding decrease in mortality is to be expected. 
V. CONCLUSION 

The nature of the positive mutations we have consid- 
ered differs from that investigated by Oliveira et al [6( in 
that it does not increase the mean fitness of the popu- 
lation over time. Our small positive mutation rate is a 
relatively weak effect and while it is able to restore the 
mean fitness of a population if pushed away from steady 
state (offering limited protection from Muller's ratchet) 
it cannot improve the population's fitness in a sustained 
manner. Strong positive mutations, such as those con- 
sidered by Oliveira et al, are capable of sustained im- 
provement in the fitness of a population but will take 
place over a far greater timescale than the weak positive 
mutations we have considered. 

Our analysis is limited to a small positive mutation 
rate. If the positive mutation rate were to become compa- 
rable to the negative one 0IH1 (it must remain smaller 
for there to be a possible steady state), then the assump- 
tion that there is no evolutionary pressure on bits after 
the terminal one would become invalid and our descrip- 
tion of the Penna string would break down. However 
it has long been accepted that positive mutation is ex- 
tremely rare compared to harmful mutation; as such this 
work addresses the relevant regime. 
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